Semi-Automatic Practical Ontology Construction by Using a Thesaurus, Computational Dictionaries, and Large Corpora

نویسندگان

  • Sin-Jae Kang
  • Jong-Hyeok Lee
چکیده

This paper presents the semi-automatic construction method of a practical ontology by using various resources. In order to acquire a reasonably practical ontology in a limited time and with less manpower, we extend the Kadokawa thesaurus by inserting additional semantic relations into its hierarchy, which are classified as case relations and other semantic relations. The former can be obtained by converting valency information and case frames from previously-built computational dictionaries used in machine translation. The latter can be acquired from concept co-occurrence information, which is extracted automatically from large corpora. The ontology stores rich semantic constraints among 1,110 concepts, and enables a natural language processing system to resolve semantic ambiguities by making inferences with the concept network of the ontology. In our practical machine translation system, our ontology-based word sense disambiguation method achieved an 8.7% improvement over methods which do not use an ontology for Korean translation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology-Based Word Sense Disambiguation by Using Semi-Automatically Constructed Ontology

This paper describes a method for disambiguating word senses by using semi-automatically constructed ontology. The ontology stores rich semantic constraints among 1,110 concepts, and enables a natural language processing system to resolve semantic ambiguities by making inferences with the concept network of the ontology. In order to acquire a reasonably practical ontology in limited time and wi...

متن کامل

Semi-Automatic Semantic Relations Extraction from Thai Noun Phrases for Ontology Learning

The critical issue in ontology construction is to extract concepts and identify ontological relations both in taxonomic and other semantic relations. In large and various domains, this task can be time-consuming and costly. In this paper, we propose the methodology to discover semantic relations embedded in Thai NPs in order to enrich the existing domain ontologies by using machine learning tec...

متن کامل

Study on Evolution of Domain Ontology

On the base of deep study of domain ontology’s evolution’s principle,gist,method and model , this paper reconstructs and uses such recognized ontology knowledge as professional thesaurus, professional dictionary and textbook and realizes knowledge collection and manipulation to build auto-learning system of depended text’s ontology finally to set up automatic construction of domain ontology’s c...

متن کامل

Automatic clustering of collocation for detecting practical sense boundary

This paper talks about the deciding practical sense boundary of homonymous words. The important problem in dictionaries or thesauri is the confusion of the sense boundary by each resource. This also becomes a bottleneck in the practical language processing systems. This paper proposes the method about discovering sense boundary using the collocation from the large corpora and the clustering met...

متن کامل

Automatic Thai Ontology Construction and Maintenance System

Ontology is an essential resource to enhance the performance of Information Processing system such as information integration, document classification in taxonomies, including information retrieval and data cleaning in database system. This paper proposes three methodologies for Automatic Thai Ontology Construction and Maintenance from technical corpus, dictionary and thesaurus. For corpus base...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001